Pitch-Synchronous Multiresolution Analysis of Music Signals

نویسنده

  • César Alonso Abad
چکیده

In this thesis a novel multiresolution approach for note detection in a polyphonic mix is proposed. The idea is to use a set of wavelets whose lengths are adapted to the theoretical fundamental period of musical notes. Using the typical wavelet dyadic decomposition we can generate a set of wavelets that match the fundamental frequency (F0) of a given note in every octave. Therefore, using a set of 12 different wavelets, one per each semitone, we can represent the fundamental frequency of every note in every octave using one wavelet scale per each octave. The magnitude and phase continuity of wavelet coefficients across temporal frames is exploited to draw a special kind of spectrogram, namely Pitch-Synchronous Wavelet Spectrogram (PSWS). When the corresponding F0 and harmonics of a note are present in the signal, a special DC pattern appears in the PSWS, due to the aforementioned continuity. Any other harmonic signal or noise produces pseudo-periodic or random AC patterns. This way, by filtering the AC components, we can identify the DC patterns in the PSWS and state the presence of a given musical note at some moment in time, even if the signal is polyphonic. For the moment, the method only works satisfactorily when the harmonic peaks in the music signal are close to the theoretical position of the frequencies of the musical notes. Some techniques are suggested in order to improve the system and extend it to non-stable pitch musical instruments.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Classification of Iranian Traditional Music Dastgahs Using Features Based on Pitch Frequency

The Iranian traditional music is composed of seven majors Dastgahs: Chahargah, Homayoun, Mahour, Segah, Shour, Nava, and Rast-Panjgah. In this paper, a new algorithm for the classification of the Iranian traditional music Dastgahs based on pitch frequency is proposed. In this algorithm, the features of Lagrange coefficients of pitch logarithm (LCPL), Fuzzy similarity sets type 2 (FSST2), and th...

متن کامل

Compression of Pseudo-periodic Signals Using 2D Wavelet Transform

An improved method to compress of pseudo-periodic 1-dimensional signals like voiced speech, music, ECG etc is suggested. The pitch synchronous property of such signals is utilized to increase the efficiency of compression, to minimize losses and thus to enhance the quality of the reconstruction. Results show higher signal to noise ratio, higher compression ratio and lower percentage distortion ...

متن کامل

Alias-free, Multiresolution Sinusoidal Modeling for Polyphonic, Wideband Audio

In this paper, we describe an improved method of generating more accurate sinusoidal parameters famplitude, frequency, phaseg from a wideband polyphonic audio source in a multiresolution, nonaliased fashion. This significantly improves upon previous work of sinusoidal modeling that assumes a single-pitched monophonic source, such as speech or an individual musical instrument. In addition to a m...

متن کامل

Pitch synchronous spectral analysis for a pitch dependent recognition of voiced phonemes - PISAR

Humans use the pitch of their conversational partner as an important feature for improving the communication and the understanding especially in noisy situations. This knowledge is taken to investigate the idea of a pitch synchronous spectral analysis and a pitch dependent recognition of voiced speech segments. A first approach is presented for realizing this pitch dependent processing. Its app...

متن کامل

Epoch-Synchronous Overlap-Add (ESOLA) for Time- and Pitch-Scale Modification of Speech Signals

Timeand pitch-scale modifications of speech signals find important applications in speech synthesis, playback systems, voice conversion, learning/hearing aids, etc.. There is a requirement for computationally efficient and real-time implementable algorithms. In this paper, we propose a high quality and computationally efficient timeand pitch-scaling methodology based on the glottal closure inst...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007